Skip to content

patch: Update timout for failvoer to 190s#47

Merged
skourta merged 2 commits into9/edgefrom
patch-failover-timeout
Apr 21, 2026
Merged

patch: Update timout for failvoer to 190s#47
skourta merged 2 commits into9/edgefrom
patch-failover-timeout

Conversation

@skourta
Copy link
Copy Markdown
Contributor

@skourta skourta commented Apr 19, 2026

This pull request adjusts the retry logic for the coordinated primary failover process to better accommodate the expected duration of a failover operation. The main change is to increase the total wait time and interval between retries, ensuring the process aligns with the 3-minute timeout used by Sentinel.

Failover process reliability:

  • Increased the number of retry attempts from 5 to 19 and the wait interval between retries from 2 seconds to 10 seconds in the failover_primary_coordinated method in src/common/client.py, allowing up to 3 minutes and 10 seconds for the failover to complete, matching Sentinel's timeout behavior.

@skourta skourta marked this pull request as ready for review April 21, 2026 07:58
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adjusts the Tenacity retry configuration used when polling Sentinel to determine whether a coordinated primary failover is still in progress, aiming to better align the polling window with Sentinel’s failover timeout expectations.

Changes:

  • Increased retry attempts in SentinelClient.is_failover_in_progress from 5 to 19.
  • Increased the fixed wait between retries from 2s to 10s.
  • Added inline comments describing the intended overall wait behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread src/common/client.py
Comment thread src/common/client.py
Copy link
Copy Markdown
Contributor

@reneradoi reneradoi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should state this in the docs, otherwise users might complain why it takes so long to remove units.

@skourta skourta merged commit 27429d9 into 9/edge Apr 21, 2026
93 of 97 checks passed
@skourta skourta deleted the patch-failover-timeout branch April 21, 2026 09:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants